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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Hillman, Jennifer L. 

Lai, Preeti 
Corley, Neil C. 
Shah , Purvi 

(ii) TITLE OF THE INVENTION: HUMAN CITRATE SYNTHASE HOMOLOG 

(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

( E ) COUNTRY : USA 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE/ DOCKET NUMBER: PF-0425 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-855-0555 

(B) TELEFAX: 650-845-4166 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: ENDCNOT 0 3 

(B) CLONE: 2171653 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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Ser Gly 
465 

(2) INFORMATION FOR SEQ ID NO-.2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY : ENDCNOT03 

(B) CLONE: 2171653 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

CGCCGGTTCG TCTACTCTTT CCTTCAGCCG CCTCCTTTCA ACCTTGTCAA CCCGTCGGCG 60 

CGGCCTCTGG TGCAGCGGCG GCGGCTCCTG TTCCTGCCGC AGCTCTCTCC CTTTCTTACC 12 0 

TCCCCACCAG ATCCCGGAGA TCGCCCGCCA TGGCTTTACT TACTGCGGCC GCCCGGCTCT 18 0 

TGGGAACCAA GAATGCATCT TGTCTTGTTC TTGCAGCCCG GCATGCCAGT GCTTCCTCCA 24 0 

CGAATTTGAA AGACATATTG GCTGACCTGA TACCTAAGGA GCAGGCCAGA ATTAAGACTT 300 

TCAGGCAGCA ACATGGCAAG ACGGTGGTGG GCCAAATCAC TGTGGACATG ATGTATGGTG 3 60 

GCATGAGAGG CATGAAGGGA TTGGTCTATG AAACATCAGT TCTTGATCCT GATGAGGGCA 42 0 

TCCGTTTCCG AGGCTTTAGT ATCCCTGAAT GCCAGAAACT GCTACCCAAG GCTAAGGGTG 48 0 

GGGAAGAACC CCTGCCTGAG GGCTTATTTT GGCTGCTGGT AACTGGACAT ATCCCAACAG 540 

AGGAACAGGT ATCTTGGCTC TCAAAAGAGT GGGCAAAGAG GGCAGCTCTG CCTTCCCATG 600 

TGGTCACCAT GCTGGACAAC TTTCCCACCA ATCTACACCC CATGTCTCAG CTCAGTGCAG 660 

CTGTTACAGC CCTCAACAGT GAAAGTAACT TTGCCCGAGC ATATGCACAG GGTATCAGCC 72 0 

GAACCAAGTA CTGGGAGTTG ATTTATGAAG ACTCTATGGA TCTAATCGCA AAGCTAC CTT 78 0 

GTGTTGCAGC AAAGATCTAC CGAAATCTCT ACAGAGAAGG CAGCGGTATT GGGGCCATTG 840 

ACTCTAACCT GGACTGGTCT CACAATTTCA CCAACATGTT AGGCTATACT GATCATCAGT 900 

TCACTGAGCT CACGCGCCTG TACCTCACCA TCCACAGTGA CCATGAGGGT GGCAATGTAA 960 

GTGCCCATAC CAGCCATTTG GTGGGCAGTG CCCTTTCCGA CCCTTACCTG TCCTTTGCAG 102 0 

CAGCCATGAA CGGGCTGGCA GGGCCTCTCC ATGGACTGGC AAATCAGGAA GTGCTTGTCT 108 0 

GGCTAACACA GCTGCAGAAG GAAGTTGGCA AAGATGTGTC AGATGAGAAG TTACGAGACT 1140 

ACATCTGGAA CACACTCAAC TCAGGACGGG TTGTTCCAGG CTATGGCCAT GCAGTACTAA 1200 

GGAAGACTGA TCCGCGATAT ACCTGTCAGC GAGAGTTTGC TCTGAAACAC CTGCCTAATG 12 60 

ACCCCATGTT TAAGTTGGTT GCTCAGCTGT ACAAGATTGT GCCCAATGTC CTCTTAGAGC 132 0 

AGGGTAAAGC CAAGAATCCT TGGCCCAATG TAGATGCTCA CAGTGGGGTG CTGCTCCAGT 138 0 

ATTATGGCAT GACGGAGATG AATTACTACA CGGTCCTGTT TGGGGTGTCA CGAGCATTGG 1440 

GTGTACTGGC ACAGCTCATC TGGAGCCGAG CCTTAGGCTT CCCTCTAGAA AGGCC CAAGT 150 0 

CCATGAGCAC AGAGGGTCTG ATGAAGTTTG TGGACTCTAA GTCAGGGTAA AACTGGAGAC 1560 

TGGGTGAAAG TGACTACCAG AAAGTGAGGA AGCCTAAATA AAAAGTATAC TTTTGTTTCA 162 0 

GGGGGCCTTT AAAGACTTAA GATTAAATTA TATCTGAGGC ACTGATAATA TGTTTGAGGT 168 0 

TAAAATATAA ATTAAGACTT TAAAAGATGA AAAATGGTCC CTTCTTCCCT AATCAGCTCC 1740 

CTTCCCCTGC CTGGTATGAG TTGCCCATCA TACGCATGGT CCTGGAGGAT GACCAGGACT 1800 

AATGCATGTG GTATGAGTAG GTTTGGCCCC CTCACTATCT C TAGAGTG AG AATCTGGCTC 1860 

CTGTTTCCAT GGGTCAAAGC CGGTTGCAGA GAATC TGTAG TCACTTTGGA GCTTTAGCTT 192 0 

CTCTGCCAAG CCCTCAATAA GCCAGCAAAC CAGGACTCTG CCCCTTCTGT TTCCATAGGA 198 0 

ATCATGTTGG ATAGTCAGCT GTACCAAGCC CCTTGGCCCT CTCCCATGCA CACAAACACC 2040 

TCCTAGCAAG ACCTGTTGGT TAGCTGGACA TGC TTTGGC A ATTTTTTTAT ACTACCAAGT 2100 

GACCATATTG GCATGGCATT TTTTGGTGAT G 2131 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 464 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 164419 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Ala Leu Leu Thr Ala Ala Ala Arg Leu Phe Gly Ala Lys Asn Ala 

15 10 15 

Ser Cys Leu Val Leu Ala Ala Arg His Ala Ser Ala Ser Ser Thr Asn 

20 25 30 

Leu Lys Asp He Leu Ala Asp Leu He Pro Lys Glu Gin Ala Arg He 

35 40 45 

Lys Thr Phe Arg Gin Gin His Gly Asn Thr Val Val Gly Gin He Thr 

50 55 60 

Val Asp Met Met Tyr Gly Gly Met Arg Gly Met Lys Gly Leu Val Tyr 
65 ' 70 75 80 

Glu Thr Ser Val Leu Asp Pro Asp Glu Gly He Arg Phe Arg Gly Tyr 

85 90 95 

Ser He Pro Glu Cys Gin Lys Met Leu Pro Lys Ala Lys Gly Gly Glu 

100 105 110 

Glu Pro Leu Pro Glu Gly Leu Phe Trp Leu Leu Val Thr Gly Gin He 

115 " 120 125 

Pro Thr Glu Glu Gin Val Ser Trp Leu Ser Lys Glu Trp Ala Lys Arg 

130 135 140 

Ala Ala Leu Pro Ser His Val Val Thr Met Leu Asp Asn Phe Pro Thr 
145 150 155 160 

Asn Leu His Pro Met Ser Gin Leu Ser Ala Ala He Thr Ala Leu Asn 

165 170 175 

Ser Glu Ser Asn Phe Ala Arg Ala Tyr Ala Glu Gly He His Arg Thr 

180 185 190 

Lys Tyr Trp Glu Leu He Tyr Glu Asp Cys Met Asp Leu He Ala Lys 

195 200 205 

Leu Pro Cys Val Ala Ala Lys He Tyr Arg Asn Leu Tyr Arg Glu Gly 

210 ~ 215 220 

Ser Ser He Gly Ala He Asp Ser Lys Leu Asp Trp Ser His Asn Phe 
225 230 235 240 

Thr Asn Met Leu Gly Tyr Thr Asp Ala Gin Phe Thr Glu Leu Met Arg 

245 250 255 

Leu Tyr Leu Thr He His Ser Asp His Glu Gly Gly Asn Val Ser Ala 

260 265 270 

His Thr Ser His Leu Val Gly Ser Ala Leu Ser Asp Pro Tyr Leu Ser 

275 280 285 

Phe Ala Ala Ala Met Asn Gly Leu Ala Gly Pro Leu His Gly Leu Ala 

290 295 300 

Asn Gin Glu Val Leu Val Trp Leu Thr Gin Leu Gin Lys Glu Val Gly 
305 310 315 320 

Lys Asp Val Ser Asp Glu Lys Leu Arg Asp Tyr He Trp Asn Thr Leu 

325 33 0 335 

Asn Ser Gly Arg Val Val Pro Gly Tyr Gly His Ala Val Leu Arg Lys 

340 345 350 

Thr Asp Pro Arg Tyr Thr Cys Gin Arg Glu Phe Ala Leu Lys His Leu 

355 " ~ ~ 360 365 

Pro His Asp Pro Met Phe Lys Leu Val Ala Gin Leu Tyr Lys He Val 

370 " 375 380 

Pro Asn Val Leu Leu Glu Gin Gly Lys Ala Lys Asn Pro Trp Pro Asn 
385 390 395 400 

Val Asp Ala His Ser Gly Val Leu Leu Gin Tyr Tyr Gly Met Thr Glu 
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Met Asn Tyr Tyr 
420 

Leu Ala Gin Leu 
435 

Pro Lys Ser Met 
450 



405 

Thr Val Leu Phe 

lie Trp Ser Arg 
440 

Ser Thr Asp Gly 
455 



410 

Gly Val Ser Arg 
425 

Ala Leu Gly Phe 

Leu lie Lys Leu 
460 



415 

Ala Leu Gly Val 
430 

Pro Leu Glu Arg 
445 

Val Asp Ser Lys 



